Is Hypothesis Testing Useful for Subcategorization Acquisition?
نویسندگان
چکیده
Statistical ltering is often used to remove noise from automatically acquired subcat-egorization frames. In this paper, we compare three diierent approaches to ltering out spurious hypotheses. Two hypothesis tests perform poorly, compared to ltering frames on the basis of relative frequency. We discuss reasons for this and consider directions for future research.
منابع مشابه
Bengali Verb Subcategorization Frame Acquisition - A Baseline Model
Acquisition of verb subcategorization frames is important as verbs generally take different types of relevant arguments associated with each phrase in a sentence in comparison to other parts of speech categories. This paper presents the acquisition of different subcategorization frames for a Bengali verb Kara (do). It generates compound verbs in Bengali when combined with various noun phrases. ...
متن کاملSubcategorization acquisition
Manual development of large subcategorised lexicons has proved difficult because predicates change behaviour between sublanguages, domains and over time. Yet access to a comprehensive subcategorization lexicon is vital for successful parsing capable of recovering predicate-argument relations, and probabilistic parsers would greatly benefit from accurate information concerning the relative likel...
متن کاملStatistical Filtering and Subcategorization Frame Acquisition
Research "into the automatic acquisition of subcategorization frames (SCFS) from corpora is starting to produce large-scale computational lexicons which include valuable frequency information. However, the accuracy of the resulting lexicons shows room for improvement. One significant source of error lies in the statistical filtering used by some researchers to remove noise from automatically ac...
متن کاملCan Subcategorization Help a Statistical Dependency Parser?
Today there is a relatively large body of work on automatic acquisition of lexicosyntactical preferences (subcategorization) from corpora. Various techniques have been developed that not only produce machinereadable subcategorization dictionaries but also they are capable of weighing the various subcategorization frames probabilistically. Clearly there should be a potential to use such weighted...
متن کاملTwo-Fold Filtering for Chinese Subcategorization Acquisition with Diathesis Alternations Used as Heuristic Information
Automatically acquired lexicons with subcategorization information have been shown to be accurate and useful for some purposes, but their accuracy still shows room for improvement and their usefulness in many applications remains to be investigated. This paper proposes a two-fold filtering method, which in experiments improved the performance of a Chinese acquisition system remarkably, with an ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007